Gabor wavelet networks for object representation
نویسنده
چکیده
The choice of an object representation is crucial for the effective performance of cognitive tasks such as object recognition, fixation, etc. because how robustly and efficiently vision tasks can be performed depends on the choice of the representation. In this work we introduce Gabor Wavelet Networks as an effective and efficient object representation. Gabor Wavelet Networks represent objects with sets of weighted Gabor wavelets that are specifically chosen to reflect the properties of the represented objects. The degrees of freedom of each Gabor wavelet are allowed to vary continuously. This is in contrast to the wellknown bunch graph approach, also based on Gabor wavelets, where the wavelet parameters are chosen according to a specific discrete scheme that is based on the discrete wavelet transform. The optimized parameter choice of the Gabor Wavelet Networks allows the representation to be very sparse and specific to the represented objects. We will show experimentally that the specificity of the parameters can be exploited for the recognition of faces. Recognition rates are shown to be as high as 97%. The degrees of freedom of wavelets allow any affine deformation that does not involve shearing. Adding shearing to the degrees of freedom, Gabor Wavelet Networks can easily be deformed affinely. This makes tracking applications very easy. Gabor Wavelet Networks represent objects through linear combinations of Gabor wavelets. Changing the dimensionality of the linear combination changes the complexity and precision of the representation. Computations based on the representation also vary in their complexity and precision. Controlling the dimensionality of the linear combinations used in vision tasks allows desired degrees of precision or speed to be achieved. This will be referred to as progressive attention. Affine variability and progressive attention will be tested in an affine real-time face tracking experiment. The scalar weights in the linear combination of wavelets can be computed by applying each Gabor wavelet as a filter. The filter is applied to (projected onto) the image only at the position indicated by the wavelet parameters. The relation between the filter responses and the weights is linear, and the responses contain the same visual information as the weights. Therefore, the optimized Gabor Wavelets of a network can be used not only for representation of an object but also for optimized filtering. We have exploited this in a head-pose estimation experiment. Our experiments have shown that the optimized filtering scheme is superior to a filtering scheme in which the filters are homogeneously distributed. The pose-estimation error was as low as 0:20Æ.
منابع مشابه
Efficient Head Pose Estimation with Gabor Wavelet Networks
In this article we want to introduce first the Gabor wavelet network as a model based approach for an effective and efficient object representation. The Gabor wavelet network has several advantages such as invariance to some degree with respect to translation, rotation and dilation. Furthermore, the use of Gabor filters ensured that geometrical and textural object features are encoded. The feas...
متن کاملGabor wavelet networks for efficient head pose estimation
In this paper we first introduce the Gabor Wavelet Network (GWN) as a model-based approach for effective and efficient object representation. GWNs combine the advantages of the continuous wavelet transform with RBF networks. They have additional advantages such as invariance to some degree with respect to affine deformations. The use of Gabor filters enables the coding of geometrical and textur...
متن کاملINSTITUT FÜR INFORMATIK UND PRAKTISCHE MATHEMATIK Gabor Wavelet Networks for Object Representation
5
متن کاملGabor wavelet representation for 3-D object recognition
This paper presents a model-based object recognition approach that uses a Gabor wavelet representation. The key idea is to use magnitude, phase, and frequency measures of the Gabor wavelet representation in an innovative flexible matching approach that can provide robust recognition. The Gabor grid, a topology-preserving map, efficiently encodes both signal energy and structural information of ...
متن کاملGabor Wavelets for 3-D Object Recognition
This paper presents a model-based object recognition approach that uses a hierarchical Gabor wavelet representation. The key idea is to use magnitude, phase and frequency measures of Gabor wavelet representation in an innovative flexible matching approach that can provide robust recognition. A Gabor gr id , a topology-preserving map, eficiently encodes both signal energy and structural informat...
متن کامل